-
Notifications
You must be signed in to change notification settings - Fork 3.6k
feat: webvtt and source tracker #2787
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
✅ DCO Check Passed Thanks @ceberam, all your commits are properly signed off. 🎉 |
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
🟢 Require two reviewer for test updatesWonderful, this rule succeeded.When test data is updated, we require two reviewers
|
292b505 to
23def92
Compare
23def92 to
f0e493d
Compare
…em and ProvenanceTrack Signed-off-by: Cesar Berrospi Ramis <[email protected]>
…classes Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
f0e493d to
58e4da9
Compare
Codecov Report❌ Patch coverage is
📢 Thoughts on this report? Let us know! |
Drop WebVTT formatting features not covered by Docling across formats. Only 'u', 'b', 'i', and 'v' are supported and without classes. Align with docling-core v2.62.0 Signed-off-by: Cesar Berrospi Ramis <[email protected]> Signed-off-by: Cesar Berrospi Ramis <[email protected]>
58e4da9 to
350594b
Compare
Signed-off-by: Cesar Berrospi Ramis <[email protected]>
dolfim-ibm
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
Refactoring of WebVTT backend parser and ASR pipeline to the latest changes of docling-core.
sourcefield inDoclingDocument(typeTrackSource).sourcefield too and thus the text is separated from the metadata (timings and speaker).Resolves #2564
Checklist: